Unit 9 


Matrices 


Introduction 


This unit introduces mathematical objects called matrices (the singular 
form is matriz). At their simplest, matrices are arrays of numbers used to 
hold information in an orderly way. In particular, they provide a useful 
way of storing data that can be organised in rows and columns. Everyday 
examples of such data include transport timetables and tables holding 
survey results. 


In this unit you will discover that arrays of numbers can be thought of as 
‘multi-dimensional numbers’. For example, you can add, subtract and 
multiply them, just as you can add, subtract and multiply numbers. 
You’ve met a similar idea before: in Unit 5, you saw that column vectors 
are numerical entities made up of more than one number, and they can be 
added, subtracted and multiplied by scalars, for example. 


One advantage of arranging large amounts of numerical data in matrix 
form is precisely that a block of data can be treated as a single entity. 
Many problems that involve large amounts of numerical data are first 
reduced to matrix problems before being solved with a computer. So 
matrices are an important topic in mathematics, and find widespread use 
in mathematical modelling, as well as in computer graphics (where they 
are used to move and rotate objects), medical imaging, economic models, 
electrical networks, data encryption and many other applications. 


As a simple practical example, consider a publisher that supplies its books 
to a number of bookshops. The publisher’s sales department might collect 
the weekly order quantities for three of its bestselling books in a grid such 
as the one in Table 1. 


Table 1 A publisher’s order quantities for three books in a certain week 


Bestseller 1 | Bestseller 2 | Bestseller 3 
Bookshop 1 10 25 12 
Bookshop 2 5 10 5 
Bookshop 3 7 5 0 
Bookshop 4 10 8 10 


The information in Table 1 can be stored more succinctly by omitting the 
row and column headings and enclosing the data in brackets, if it is 
understood that each row corresponds to a bookshop and each column 
corresponds to a book: 


10 25 12 
5 10 5 
T 5 0 
10 8 10 


This is an example of a matrix. Information about order quantities can be 
collected every week for the life of the three books and stored in this form. 
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For example, the information for two consecutive weeks can be recorded in 
two separate matrices: 


10 25 12 15 30 20 
5 10 5 10 20 10 
T 5 OY?’ 20 8 5 
10 8 10 15 0 10 


The order quantities for a fortnight can now be calculated by adding 
corresponding numbers in the two arrays, which gives 


10+15 25+30 12+ 20 25 55 32 
5+10 10+20 5+10 that is 15 30 15 
7+20 5+8 0+5 |’ í 27 13 =«5 
10+15 8+0 10+ 10 25 8 20 


This calculation amounts to adding the two matrices, in a sense that will 
be made precise in Section 1. 


You might wonder what the advantage is in carrying out calculations in 
this way. After all, the total number of orders for each bookshop can be 
worked out in a straightforward way without resorting to matrices. In a 
real-life situation, however, a large publisher will supply hundreds of 
bookshops with hundreds of books. Calculating order quantities and total 
charges requires careful accounting, and it is precisely in this sort of 
situation that matrices become a useful way to store data (especially 
electronically) and carry out calculations with them. 


Matrices were first introduced in Western mathematics in their 
current form by the British mathematician Arthur Cayley 
(1821-1895). However, they appeared as early as 300-200 BC in the 
Chinese text The nine chapters on the mathematical art, where 
matrix methods are used to solve simultaneous equations. You’ll meet 
a modern version of these methods in Section 4. 


There are other useful operations that can be carried out when data are 
stored in matrix form, such as the multiplication of matrices by single 
numbers, and the multiplication of two matrices. You'll learn about these 
operations in Section 1, and see an application of matrix multiplication in 
Section 2. In Section 3 you'll learn about matrix inverses, and in Section 4 
you'll meet a connection between matrices and systems of linear equations. 


1 Matrices and matrix operations 


1 Matrices and matrix operations 


In this section you'll meet matrices and standard matrix notation, and 
learn about matrix addition, multiplication of matrices by single numbers 
(known as scalar multiplication of matrices), matrix multiplication and 
matrix powers. 


1.1 Matrix notation and terminology 


A matrix (pronounced ‘may-tricks’) is a rectangular array of numbers, 
usually enclosed in brackets. Here are some examples: 


3 A 

(5 2 6) € 1) 2 g 

5 3 0 3 1 2 0 
Some texts use square brackets for matrices, like this: 


il 


In this module we’ll always use round brackets. 


We often use capital letters to denote matrices. For example, we might 
write 


1 
= 4 . : 
1 2 -l 1 2 3 First rule of consulting: never, 
ne € 3 3) 2 ( et T 6]: never call this a “table”. 
22 0 We can bill four times as much 


In the text in this module, the capital letters that denote matrices are in when we call it a “matrix”. 


bold font, but you don’t need to do anything special when you handwrite 
them. In particular, you don’t need to underline them, unlike letters that 
denote vectors. 


The numbers in a matrix are called its elements (or its entries, in some 
texts). For example, matrices A and C above each have six elements, and 
matrix B has four elements. In this module, the elements of matrices are 

always real numbers. 


A horizontal line of numbers in a matrix is called a row, and a vertical line 
of numbers is called a column. For instance, in the matrix A above the 
first row is (1 2 —1) and the second row is (5 3 0). Likewise, the first 


column is Gs the second column is (5) and the third column is (F 


The matrix A has two rows and three columns: we say that it is a 2 x 3 
matrix. In general, a matrix with m rows and n columns is described as an 
m x n matrix, or a matrix of size m x n. The first number in this 
notation is always the number of rows, and the second number is always 
the number of columns. The size m x n of a matrix is read as ‘m by n’. 


A matrix with the same number of rows as columns is called a square 
matrix. Thus, for example, matrix B above is square. 
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The column notation for vectors that you met in Unit 5 is a particular 
instance of matrix notation: two-dimensional column vectors are 2 x 1 
matrices, and three-dimensional column vectors are 3 x 1 matrices. In 
general, the word vector is used to mean any matrix with a single column, 
even if it has more than 3 elements and hence has no geometric 
interpretation in the plane or three-dimensional space. For instance, the 
following matrices are vectors: 


V2 
—3 
, where w, x, y, z are variables. 


xe REg 


The elements of a vector are often called its components, as in Unit 5. 
A vector with n components is called an n-dimensional vector. 


There is a useful notation for the elements of a matrix in terms of their 
row and column positions. The element in row i and column j of a 
matrix A is denoted by aij. Similarly, the element in row 7 and column j 
of a matrix B is denoted by b;;, and so on. 


For instance, if 


1 
kale 3 
7 -<26)° 


then aj, = $, a12 = 4, a21 = 7 and a22 = —6. 


This notation is used for matrices of any size. Thus, for instance, a general 
3 x 4 matrix can be denoted by 


a11 a12 @13 414 
A = | a2 a22 a23 az 
a31 432 Q33 G34 


Activity 1 Using matrix notation 


Write down the elements a33, a34, 443, Q14, @44 and a42 of the matrix 


2 8 O -1 


4 7 5 -2 
eE 9 -l 3 =4 
60—59 =i =9 


What is the size of A? 


Two matrices are equal if they have the same numbers of rows and 
columns, and all corresponding elements are equal. For example, 


G i)=G 4) 


1 Matrices and matrix operations 


but 


i. 
12 1 2 1 3 
(As 4} om GG 4G a) 


You can carry out operations on matrices in a similar way to numbers. For 


example, you can add, subtract and multiply them, as youll see in this 
section. There’s also a way to manipulate matrices that in certain 
circumstances plays the role of division, which you’ll learn about in 


Section 3. While matrix addition is similar to addition of vectors, which 
you met in Unit 5, matrix multiplication is more surprising. We’ll start by 


looking at matrix addition. 


Arthur Cayley gave the rules for matrix operations in a paper 


presented to the Royal Society in 1858. The paper also contained the 


conditions under which a matrix has an inverse (you'll meet matrix 
inverses in Section 3). However, it was Cayley’s friend and fellow 


mathematician James Joseph Sylvester who coined the term ‘matrix’, 


in 1850, and he also contributed to the theory of matrices. 


Before being appointed to a mathematics professorship at the 
University of Cambridge, Cayley worked for 14 years as a lawyer in 


London. At the time, Sylvester was employed as an actuary. The two 


worked together at the courts of Lincoln’s Inn, which gave them the 
opportunity to discuss mathematics. 


Sylvester had a more varied mathematical career than Cayley, 


including professorships at University College London, Johns Hopkins 


University in the USA, and the University of Oxford. It is said that 


he tutored Florence Nightingale in mathematics, though firm evidence 


for this has yet to be found. 


1.2 Matrix addition and subtraction 


You can add two matrices only if they have the same size. The sum of two 
matrices is obtained by adding their corresponding elements. For example: 


—2 4 3 il —2+3 441 k 5 
1 5ļ]+{-2 10] ={1+(-2) 5+10ļ]=ļ|-1 15 
0 -3 -6 4 0+(-6) -3+4 —6 1 


The rule for adding two matrices can be expressed formally as follows. 


Arthur Cayley 
(1821-1895) 


James Joseph Sylvester 
(1814-1897) 
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Matrix addition 


If A and B are m x n matrices, then A + B is the m x n matrix 
whose element in row 7 and column j is aij + bij. 


Addition of two matrices of different sizes is not defined. 


For instance, you cannot add a 3 x 3 matrix and a 3 x 5 matrix. 


The rule for adding matrices extends to the sum of any number of matrices 
of the same size: you just add corresponding elements. 


Activity 2 Adding matrices 


For each of the pairs of matrices below, work out the matrix sum A + B. 


wa-@ 2). ae 9 


In Unit 5, you saw that vector addition shares many properties with the 
addition of numbers. This happens for matrix addition, too. For example, 
you can see from the definition of matrix addition that 


A+B=B+A, 


for all matrices A and B of the same size. As you saw with vectors, this 
property is expressed by saying that matrix addition is commutative. 


You can also see from the definition of matrix addition that 
(A+B)+C=A+(B+OC), 


for all matrices A, B and C of the same size. In other words, when you 
add three matrices A, B and C of the same size, it doesn’t matter whether 
you add A and B and then add C to the result, or add B and C and then 
add A to the result, as you get the same overall result either way. This 
property is expressed by saying that matrix addition is associative. It 
tells you that you can write the expression 


A+B+C 


without ambiguity: it doesn’t matter whether it means (A + B) + C or 
A + (B + C), as both are equal. 


There are other respects in which matrix operations behave like number 
operations. You will discover more similarities between matrices and 
numbers as you meet other matrix operations. 


1 Matrices and matrix operations 


You can also subtract matrices of the same size by subtracting their 
corresponding elements. For example, 


4 5 3 13 4\ (4-1 5-3 3-4 

7 -6 0) (2 9 1) \7-3 =—6-9 0-1 
B 5 =i 
Tia i aed 


Here is the formal definition of matrix subtraction. 


Matrix subtraction 


If A and B are m x n matrices, then A — B is the m x n matrix 
whose element in row 7 and column j is aij — bij. 


Subtraction of two matrices of different sizes is not defined. 


Like the subtraction of numbers, matrix subtraction is neither 
commutative nor associative. For example, it is not true that 

A —B=B- A for all matrices A and B of the same size, as illustrated 
by parts (a) and (b) of the next activity. 


Activity 3 Subtracting matrices 


_ A — 
Let A = A ,B= 2 and C = 0 =g . Work out the 
1 0 ţi 3 aa 0 


following matrices. 
(a) A-B b) B-A (c) B-C 


Since vectors are single-column matrices, they are added and subtracted 
by adding or subtracting corresponding components, as in the examples 
below. 


1 7 1+7 8 
= =ý -3 + (—2) 5 
4} + 1] = 4+1 = 5 
1 3 1 3 
3 3 3+5 2 
0 =l Ei =i 
10 -= i= (2) 12 
apf St..] 3-3 | 7 o 
1 a ee) ee ee en eee | 
=i 1 =i i = 


In other words, the ideas of addition and subtraction of column vectors 
that you met in Unit 5 extend in the obvious way to vectors with more 
than three components. 
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A matrix each of whose elements is zero is called a zero matrix, and is 
usually denoted by 0. For example, the 2 x 2 zero matrix is 


o= (2 2) 


Also, as you’d expect, if A is any matrix, then the matrix formed by 
changing each element of A to its negative is called the negative of A, 
and denoted by —A. For example, 


0 1). (U0 =1 
SM! ATN Aj“ 
It follows from the definitions of matrix addition and subtraction that for 
any matrices A and B of the same size, 
A-B=A+(-B), 


as you’d expect. 


1.3 Scalar multiplication of matrices 


Any matrix can be added to itself, and a matrix sum A + A can be 
written as 2A. Each element in 2A is twice the corresponding element 
in A, as in the following example: 


1 05 1 05 2 0 10 
-i 22 S$) 271. = S) =) 0 =4 6 
3 ł gf 3 + 1l 6 1 2 


Similarly, adding A to itself n times, for any natural number n, gives a 
matrix in which each element is the corresponding element in A multiplied 
by n. 


This idea can be generalised to any real number k, so kA is the matrix 
obtained by multiplying each element of A by k. For example, 


at i) = 5x4 4x3 R 3 
2 \0 =1 5x0 5x (-1) 0-3) 
The matrix kA is called the scalar multiple of the matrix A by the real 


number k, and the operation of forming kA is called scalar 
multiplication. 


Scalar multiplication 


If A is an m X n matrix and k is any real number, then kA is the 
matrix whose element in row i and column j is kaij. 


The number k in a scalar multiple kA of a matrix A is sometimes referred 
to as the scalar k in this context, to emphasise that it’s a number rather 
than a matrix. 


1 Matrices and matrix operations 


Activity 4 Multiplying matrices by scalars 


(a) Calculate the following products of scalars and matrices, simplifying 
your answers. 


(i) (5 i (ii) Te =) (iii) 5 (3 l 


(b) Simplify each of the following matrices by writing it as a scalar 
multiple of a matrix with integer elements. 


i) : a Gi) |) (iii) G sr) 


Notice that it follows from the definition of scalar multiplication that, for 
any matrix A, 


as you’d expect. 


In the next activity you can practise the matrix operations that you’ve 
learned so far. 


Activity 5 Combining matrix addition, subtraction and scalar 
multiplication 


+ 12 1 i 2 0 -1 
Let A = | ? ,B=| ? andC={ ? À 
4 —2 0 =2. 2 -1 3 5 


Where possible, evaluate A — C, B+ 2A and C — 2A. 


Here’s a summary of some useful properties of matrix addition and scalar 
multiplication of matrices. As you’d expect (since vectors are single-column 
matrices), these properties are the same as the properties of vector algebra 
that you met in Unit 5, but with vectors replaced by matrices. 
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Properties of matrix addition and scalar multiplication 


The following properties hold for all matrices A, B and C for which 
the sums mentioned are defined, and for all scalars m and n. 


A+B=B+A 
(A+B)+C=A+(B+C) 
A+0=A 

A+(—A)=0 
m(A+B)=mA+mB 
(m+n)A=mA+4+nA 
m(nA) = (mn)A 

1A=A 


Co aS ee ÇN pS Se ee 


n 


In properties 3 and 4, O is the zero matrix of the same size as A.) 


1.4 Matrix multiplication 


In Subsection 1.3 you learned how to multiply a number and a matrix. 
Under certain conditions, it is also possible to multiply two matrices. The 
way that this is done is less obvious than matrix addition or scalar 
multiplication. 


Recall the example in the introduction to this unit, where a matrix was 
used to store the numbers of copies of three bestselling books that four 
bookshops ordered from a publisher in a certain week: 


Bestseller 1 Bestseller 2 Bestseller 3 


Bookshop 1 10 25 12 
Bookshop 2 5 10 5 
Bookshop 3 7 5 0 
Bookshop 4 10 8 10 


Now suppose that the publisher charges the bookshops £4 per copy for 
Bestseller 1, £7 per copy for Bestseller 2 and £9 per copy for Bestseller 3. 
According to the data in the matrix, Bookshop 1 ordered 10 copies of 
Bestseller 1, 25 copies of Bestseller 2 and 12 copies of Bestseller 3. 
Therefore the total amount, in £, that Bookshop 1 owes the publisher for 
that week’s order is 


10x 44+25x 7 + 12 x 9 = 323. 


The expression on the left of this equation is obtained by multiplying each 
element in the first row of the matrix by the price of the corresponding 
book, and then adding the results. 


1 Matrices and matrix operations 


If we write the prices (in £) for the three bestsellers as a three-dimensional 
vector, that is, as a 3 x 1 matrix, then we can picture this procedure as 
shown in Figure 1. 


gives 10x 44+ 25x 7+12x9 


Figure 1 The procedure for calculating the amount owed by Bookshop 1 


If we want to calculate the amount owed by Bookshop 2, then we can carry 
out the same procedure with the second row of the matrix, and likewise for 
Bookshops 3 and 4 with the third and fourth rows, respectively. We can 
write the four amounts that we obtain as a four-dimensional vector. The 
procedure described above combines the two matrices 


10 25 12 


oO ae and : 

7 5 O 9 

10 8 10 

to give 

10x 4+25x7+12x9 323 
5x44+10x7+5x9 _ | 135 
7x4+5x7+0x9 ~ | 63 
10x4+8x7+10x9 186 


This calculation shows that the amounts owed by the four bookshops are 
£323, £135, £63 and £186, respectively. 


This procedure for combining matrices comes up so often in applications 
that it is used to define matrix multiplication. We say that 


10 25 12 4 323 

5 10 5 : 135 
the product of 7 5 0 and : is 63 

10 8 10 186 
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Now suppose that as well as calculating the amount owed by each of the 
four bookshops, the publisher also wants to calculate the profit made from 
sales to each individual bookshop. Suppose that the profits made on each 
copy of Bestsellers 1, 2 and 3 are £1, £2 and £2, respectively. You can use 
exactly the same procedure as above to calculate the profit made from the 
sales to each bookshop. You write the profits (in £) made on the three 
books as a three-dimensional vector, and form 


10 25 12 1 
the product of w A and 2], 
7 5 Ø 9 
10 8 10 
which is 
10x 1+25x2+12x2 84 
5x14+10x2+5x2 | | 35 
7x14+5x24+0x2 ~ | 17 
10x 1+8x2+10x2 46 


So the profits made from sales to the four bookshops are £84, £35, £17 
and £46, respectively. 


In fact you can view the two matrix multiplications above — the one for the 
amounts owed, and the one for the profits — as a single matrix 
multiplication. To do this, you put the two vectors containing the prices 
and the profits together as a single matrix with two columns, and you put 
the two vectors that contain the results of the calculations together as a 
single matrix with two columns, and say that 


10 25 12 4 323 84 
5 10 5 135 35 

the product of 7 5 0 and : ; 63 17l; (1) 
10 8 10 186 46 


This calculation is an example of how matrices are multiplied in general. 
In Section 2 you’ll meet another context that shows that it makes sense to 
multiply matrices in this way. Before that, you’ll see a more detailed 
explanation of the procedure for multiplying matrices, and have a chance 
to practise it yourself. 


Let’s start by considering which pairs of matrices can be multiplied 
together. Remember that the element in the first row and first column of 
the product matrix in calculation (1) above was obtained by multiplying 
each element in the first row of the first matrix by the corresponding 
element in the first column of the second matrix, and adding the results: 


10 x 4 +25 x 7 +12 x 9 = 323. 


It was possible to do this because the number of columns in the first matrix 
is the same as the number of rows in the second matrix, so each element in 
the first row of the first matrix has a corresponding element in the first 
column of the second matrix. You can multiply two matrices together only 
if this condition holds. 


1 Matrices and matrix operations 


Thus, you can multiply together a 4 x 3 matrix and a 3 x 2 matrix. On 
the other hand, it is not possible to multiply together a 3 x 4 matrix and a 
2 x 3 matrix. When it is not possible to multiply two matrices, we 
sometimes say that their product is undefined. 


A convenient way to determine whether two matrices, of sizes m x n and 
p x q, say, can be multiplied together is to write their sizes next to each 
other as follows: 


mxXn pXq. 


The matrices can be multiplied only if the two numbers in the middle are 
equal; that is, if n = p. If these numbers are equal, then the size of the 
product matrix is given by the remaining numbers. So the product of an 
m x n matrix and an n x q matrix is a matrix of size m x q. 


For example, a 4 x 3 matrix and a 3 x 2 matrix can be multiplied together, 
and the product has size 4 x 2. Here’s a useful way to picture this fact: 


4x{ 3 | [3 ]x2 gives 4x 2. 


Now let’s look at the procedure for multiplying two matrices step by step. 
It’s demonstrated in the next example. 
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®. To obtain the element in the first row and first column of AB, 
multiply each element in the first row of A by the corresponding 
element in the first column of B, and add the results. ® 


-2 1 3 im 0 (-2)x 1+1x $43 (-1) 
J 4 5 2| = 
0-1 0 =g 4 
T9 
2 


@®,. To obtain the element in the first row and second column of AB, 
apply the same procedure to the first row of A and the second column 
of B. & 


—2 1 3 1g -3 (-2)x0+1x2+3x4 
2 4 -2 5 2)= 
0-1 O -1 K 

g 

-3 14 


@. To obtain the element in the second row and first column of AB, 
apply the same procedure to the second row of A and the first column 
of B. ® 


D I g m 0 -3 14 
a a) m2) = a 24 (-2) x E1) 
er = 4 


9 
a 
— 6 


@. In general, to obtain the element in the ith row and jth column 
of AB, multiply each element in the ith row of A by the 
corresponding element in the jth row of B, and add the results. © 


=Æ 1i 8 1 0 
J 4 =2 5 2 
© = 0/\=-1 4 
-3 14 
— 6 2x0+4x2+(—2)x4 
0x1+(-1)x+0x(—1) 0x04+(-1)x2+0x4 
9 
—? 14 
=| 6 0 
1 
=p =) 


1 Matrices and matrix operations 


Here’s a summary of the procedure for matrix multiplication. 


Matrix multiplication 


Let A and B be matrices. Then the product matrix AB can be 
formed only if the number of columns of A is equal to the number of 
rows of B. 


If A has size m x n and B has size n x p, then the product AB has 
size m x p. 


The element in row į and column j of the product matrix AB is 
obtained by multiplying each element in the ith row of A by the 
corresponding element in the jth column of B and adding the results. 


In element notation, if ci; denotes the element in the ith row and 
jth column of AB, then 


Cig = @ibij + ainda; + +++ + Gindn;. 


Figure 2 illustrates how row 7 of a matrix A and column j of a matrix B 
are combined to give the element in row į and column j of the product 
matrix AB. 


a1; + aizbzj feeb Gin ng 
Figure 2 The element in row 7 and column j of a product matrix AB 
expressed in terms of elements from A and B 


Matrix multiplication may seem quite complicated at first, but it is a very 
useful technique, and with practice you should become proficient at it. 
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Activity 6 Multiplying matrices 


In each of parts (a)—(e) below, calculate the matrix product if it exists. 


2 3 1 3 
(a) Teir (b) (1 2 3){8 
-6 8 T 
2 9 0 10 9 
Cm | 71 4 1 (a) [Spa 2 3) 
—6 8/ \-2 7 
ie afl? i 
(e) {7 6 ,} {1 9 -1 
21 3 


Matrix multiplication shares many of the properties of multiplication of 
numbers, as you'll see later in this subsection and later in the unit. 
However, there’s an important difference between the properties for 
matrices and those for numbers, as you can find out in the next activity. 


Activity 7 Investigating matrix multiplication 


(a) Determine whether the products AB and BA are defined for the 
following pairs of matrices. 


(b) For each of parts (a)(i) and (a)(ii), if the products AB and BA are 
both defined, evaluate them. What do you notice? 


Activity 7 illustrates the important fact that matrix multiplication is not 
commutative. First, there are matrices A and B for which the product AB 
exists but the product BA is not defined. Second, even in cases where 
both the products AB and BA are defined, these two products can be 
different matrices. 


In the next activity, you can investigate whether another property that 
holds for multiplication of numbers also holds for multiplication of 
matrices. 
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Activity 8 Investigating matrix multiplication further 


Let A = (-1 0), B=(} E adc=( 3 S i) 


(a) What are the sizes of the products AB and BC? Calculate these 
matrices. 


(b) Do the products (AB)C and A(BC) exist? If so, calculate them. 
What do you notice? 


In Activity 8 you should have found that, for the particular three matrices 
A,B and C in the activity, (AB)C = A(BC). This does not of course tell 
you that this property holds for matrices in general. However, this 
property does hold in general: whenever A, B and C are matrices such 
that the products AB and BC are defined, we have 


(AB)C = A(BC). 


In other words, matrix multiplication is associative. In this respect matrix 
multiplication behaves like multiplication of numbers. 


In the next activity you can investigate how matrix multiplication 
interacts with multiplication by a scalar. 


Activity 9 Combining matrix multiplication and multiplication by a 
scalar 


Let A = a) and B = i Show that 


A (2B) = 2(AB). 


In Activity 9 you verified a particular instance of a general property of 
matrices: if the matrix product AB exists, then, for any scalar k, 


A (kB) = (kA)B = k(AB). 


In the next activity you’re asked to investigate how matrix multiplication 
interacts with matrix addition. 
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Activity 10 Combining matrix addition and multiplication 


1 —1 1 —1 
Let A= (3 Bag) and C= ( i): 
By calculating A(B + C) and AB + AC, show that 
A(B+C)=AB+AC. 


In Activity 10 you verified a particular instance of another general 
property of matrices: if the matrix product A(B + C) can be formed, then 


A(B+C)=AB+AC. 
It’s also true that if the matrix product (A + B)C can be formed, then 
(A+B)C =AC+BC. 
In other words, matrix multiplication is distributive over matrix addition. 
So, for example, if you encounter a matrix expression of the form 
AB+ AC, 
then you can factorise it to give 
AB+AC = A(B+C). 


Here’s a summary of the properties of matrix multiplication that you’ve 
seen so far. 


Some properties of matrix multiplication 


The following properties hold for all matrices A, B and C for which 
the products and sums mentioned are defined. 


e (AB)C=A(BC) 

e k(AB) = (KA)B = A(kB), for any scalar k 
e A(B+C)=AB+ AC 

e (A+B)C=AC+BC 


Remember also the following important fact. 


Matrix multiplication is not commutative 


e There are matrices A, B such that the product AB exists but 
the product BA does not. 


e Even when both products are defined, it can happen that 
AB Æ BA. 


1 Matrices and matrix operations 


Matrix powers 


Just as the square a? of a real number a is defined to be the product a x a, 
so we define the square A? of a matrix A to be the matrix product AA, 
if it exists. 


For example, if 
2 1 
a= (5 0): 


then 


Bae Go. 


In the next activity you’re asked to investigate which matrices can be 
squared. 


Activity 11 = Investigating which matrices can be squared 


1 0 3 
0 2 4 


2 =l 
0 2 


(c) Describe which matrices can be squared. 


(a) Let M = ( ) . Why is it not possible to form the product MM? 


(b) Let N = ( l Is the product NN defined? If, so, evaluate it. 


In Activity 11 you saw that if A is a square matrix (a matrix with the 
same number of rows as columns), then its square A? exists. The 
matrix A? has the same size as the original matrix A. 


Be careful not to confuse the two different concepts square matrix and 
square of a matriz. 


If A is a square matrix, then since A? is a square matrix of the same size 
as A, we can also form the products (A?)A and A(A?). Since matrix 
multiplication is associative, we know that 


A7?A = (AA)A = A(AA) = AA’. 


Hence either of these products can be written unambiguously as AAA. 
This matrix is called the cube of A and is denoted by AÌ. For example, 
for the matrix 


"i 


discussed at the beginning of this subsection, we have 


4. 20, TARN 207 
asata fa s) (a o) = Ca o) 
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In general, for any square matrix A, the nth power of A is 


A" =AA..-A. 
a 


n times 


Activity 12 Calculating matrix powers 


(a) Calculate the squares of the following matrices: 


2 1 


(b) Let A = & i 


) . Calculate Aĉ. 


Hint: the cube A® of this matrix A was calculated before the activity. 


Matrix multiplication with a computer 


Matrix multiplication can be time-consuming to carry out when the 
matrices involved are large. In the next activity you can find out how to 
use a computer to carry out matrix operations, including multiplication. 
This work will increase your familiarity with the way matrices behave, and 
help you distinguish between circumstances when it is worth using a 
computer to perform matrix calculations and when it is more efficient to 
do the calculations by hand. 


Activity 13 Working with matrices on a computer 


Work through Subsection 10.1 of the Computer algebra guide. 


2 Matrices and networks 


Matrices can be used to represent and analyse various flows in networks. A 
network is a collection of objects connected by links — either physical or 
abstract — such as cities connected by roads, telephone exchange points 
connected by telephone lines, or people connected by acquaintance. A 
foremost example is the internet, which is a network of communicating 
computers linked by a range of technologies. In this section you'll look at 
flows in networks of a particular type, and learn how to use matrices to 
work with them. 


2.1 Networks 


Figure 3 shows a simple example of a network of water pipes. Water can 
be poured into the system at position A or B, and it then flows down the 
pipes to come out at positions U, V and W. The pipes have different 
capacities: the three pipes that come out of position A are labelled with 
numbers that indicate the proportions of water that flow through these 
pipes when water is poured in at A, and the three pipes that come out of 
position B are labelled in the same way. For example, if a litre of water is 
poured in at position A, then 0.4 litres flows down the first pipe from A, 
0.2 litres flows down the second pipe from A, and 0.4 litres flows down the 
third pipe from A. 


U 
w Œ ‘te 


Figure 3 A network of water pipes 


A network like this can be represented more simply as a network 
diagram, which is a mathematical representation of a physical network. 
The network diagram in Figure 4 represents the network in Figure 3. 


Figure 4 A network diagram representing the pipe system in Figure 3 


The dots in a network diagram are called nodes, and the lines are called 
arcs. The arrows on the arcs in Figure 4 are included because this network 
diagram represents a network in which flow along an arc takes place in 
only one direction. When no confusion arises, we use the word ‘network’ to 
mean either ‘network diagram’ or ‘physical network’. We’ll use network 
diagrams to represent networks from now on in this section. 
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Example 2 Calculating network outputs 


The network diagram in Figure 4 is repeated below. 


How much water is output at each of nodes U, V and W if the only 
input is 1 litre of water at node B? 


Solution 


@. The label on the arc joining node B to node U indicates the 
proportion of the water input at node B that reaches node U. © 


The label on the arc from node B to node U is 0.3. So 0.3 litres of 
water is output at node U when 1 litre is input at node B. 


@. Read the labels on the other arcs from node B. & 


Similarly, 0.1 litres of water is output at node V, and 0.6 litres of 
water is output at node W. 


Now consider what happens if water is input at both node A and node B 
in the network in Example 2. The amount of water reaching an output 
node is the sum of the water reaching the node from node A and from 
node B. For example, if 1 litre of water is input at node A and another 

1 litre is input at node B, then node U will collect 0.4 litres of water from 
node A and 0.3 litres from node B. So, in total, 0.7 litres of water will be 
output at node U. Similarly, 0.2 + 0.1 = 0.3 litres of water will be output 
at node V, and 0.4 + 0.6 = 1 litre of water will be output at node W. 


In general, if x litres of water is input at node A and y litres of water is 
input at node B, then 


0.42 + 0.3y litres of water is output at node U, 
0.2% + 0.1y litres of water is output at node V, 


0.4% + 0.6y litres of water is output at node W. 


In the next activity you can practise calculating network outputs yourself. 
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Activity 14 Calculating network outputs 


The following network diagram represents a network of water pipes. 


U V W 


(a) How much water is output at nodes C and D if 1 litre of water is 
input at node U, 2 litres of water is input at node V, and there is no 
input at node W? 


(b) How much water is output at nodes C and D if x litres of water are 
input at node U, y litres of water is input at node V, and z litres of 
water is input at node W ? 


Network diagrams like those that you’ve seen in this subsection can be 
used to model many other kinds of flows, such as the movement of money 
in and out of companies. For example, consider three corporate donors, D, 
E and F, say, that every month make donations to two charities, J and K. 
Donor D donates 20% of its monthly designated sum to charity J and 80% 
to charity K, donor E donates 30% of its sum to charity J and 70% to 
charity K, and donor F donates 50% of its sum to charity J and 50% to 
charity K. 


The flow of money from the three donors to the two charities can be 
represented by the network in Figure 5. For example, a total donation of 
£1000 from donor D in a given month is modelled by an input of £1000 at 
node D. The label of the arc connecting D to J tells you that 0.2, or 20%, 
of that sum, which is £200, reaches the output node J. 


D E F 


Figure 5 A network diagram representing the flow of money from three 
corporate donors to two charities 
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Activity 15 Calculating more network outputs 


The network diagram in Figure 5 is repeated below. 


D E F 


It represents the flow of money from donors D, E and F to charities J 
and K, as described above. How much money is output at each of nodes J 
and K in each of the following situations? 


(a) The only input is £2000 at node D. 


(b) The input at node D is £2400, the input at node F is £1700, and the 
input at node F is £1100. 


(c) The input (in £) at node D is x, the input (in £) at node E is y, and 
the input (in £) at node F is z. 


2.2 From networks to matrices 


In this subsection you’ll see how network output calculations of the kinds 
that you saw in Subsection 2.1 can be expressed succinctly using matrices. 


Consider again the first network representing water pipes that you saw in 
Subsection 2.1. It is repeated in Figure 6. 


Figure 6 A network representing water pipes 


The amounts of water (in litres) input at nodes A and B can be 
represented as a two-dimensional vector (a 2 x 1 matrix), whose first and 


242 


second components are the inputs at A and B, respectively. For example, 
the vector 


() 


represents 0 litres of water input at node A and 1 litre of water input at 
node B. 


The outputs from the network can also be represented by a vector. For 
example, the vector 


0.4 
0.2 
0.4 


represents an output of 0.4 litres of water at node U, 0.2 litres at node V, 
and 0.4 litres at node W. 


The network itself can be represented by the matrix 


0.4 0.3 
0.2 0.1 
0.4 0.6 


Here the elements in each column are the labels of the arcs from a node 
(A or B), and the elements in each row are the labels of the arcs to a node 
(U, V or W), as shown below: 


A B 
U /04 0.3 
V {02 0.1 
W \0.4 0.6 


For example, the element in the column labelled B and the row labelled U 
is the label of the arc from node B to node U. You'll see the reason for 
arranging the arc labels in this way in a moment. 


You saw in Subsection 2.1 that if x litres of water is input at node A and 
y litres of water is input at node B, then 


0.47 + 0.3y litres of water is output at node U, 
0.27 + 0.1y litres of water is output at node V, 


0.4x + 0.6y litres of water is output at node W. 


In other words, 


s 0.4g + 0.3y 
if the input vector is ( ) , then the output vector is | 0.22 + 0.ly 
Y 0.4x + 0.6y 


This output vector can be expressed as the product of the matrix that 
represents the network and the input vector: 


0.42 + 0.3y 0.4 0.3 


0.2z +0.1y | = {02 0.1 al 
0.4x + 0.6y 0.4 0.6) Y 


2 Matrices and networks 


243 


Unit 9 Matrices 


t think those water 
engineers may have used 


244 


So we can say that 


0.4 0.3 
if the input vector is z , then the output vector is | 0.2 0.1 (*) : 
y y 
0.4 0.6 
So, for any input vector, you can work out the corresponding output vector 


by multiplying the input vector by the matrix that represents the network. 
This is illustrated in the next example. 


Example 3 Using matrices to calculate network outputs 


Use the matrix that represents the network diagram in Figure 6 to 
calculate the outputs at nodes U, V and W if 2 litres of water is 
input at node A and there is no input at node B. 


Solution 


@. The matrix representing the network was found before this 
example. ® 


The matrix representing the network is 


0A Ors 
02 OW. 
0.4 0.6 


The input vector (in litres) is 


2 
0): 
@. The output vector is the product of the matrix representing the 


network and the input vector. ® 


Hence the output vector is 


0.4 0.3 9 0.4 x 2+0.3 x 0 0.8 
OLA 0i (o) = | 02x 240.1 xO || = || O4 
0.4 0.6 0.4 x 2+0.6 x 0 0.8 


So the outputs at nodes U, V and W are 0.8 litres, 0.4 litres and 
0.8 litres, respectively. 


In general, to represent a network by a matrix in the way described above, 
you need a column for each input node and a row for each output node. So 
if the network has n input nodes and m output nodes, then the matrix has 
size M X n. 


When you find a matrix representing a network of the type considered in 
this section, it’s useful to check that the elements in each column have 
sum 1. The matrix must have this property because the elements in each 
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column are the labels on an arc from a particular node. You can see that 
the matrix representing the network in Example 3 has this property. 


Activity 16 Using matrices to calculate network outputs 


The network in Activity 14, which represents a network of water pipes, is 
shown again below. 


(a) Write down the matrix that represents this network. 


(b) Write down a vector that represents an input of 1 litre of water at each 
of nodes U, V and W. 


(c) Use your answer to part (a) to calculate the output vector for the 
input vector in part (b), and interpret your answer in terms of the 
amounts of water output at nodes C and D. 


2.3 Combining networks 


Networks can be combined by making the outputs from one network 
become the inputs to another network. In this section you’ll see how to use 
matrices to calculate the outputs from a combined network using the 
information about the original networks. 


The network diagram in Figure 4 and the one in Activity 14 are repeated 
in Figure 7. Both diagrams represent networks of water pipes. 


Figure 7 Two network diagrams 
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The first network has three output nodes, and the second network has 
three input nodes. We can join these nodes so that, for instance, the water 
output at U in the first network flows down the pipes that leave U in the 
second network. The combined network is shown in Figure 8. 


Figure 8 The combined network 


The network obtained in this way can be considered as equivalent to a 
simpler network with arcs going directly from the input nodes A and B to 
the output nodes C and D, as illustrated in Figure 9. In general, we 
regard two networks of the type considered in this section as equivalent if 
they have the same input and output nodes, and all choices of inputs 
produce the same outputs in both networks. 


A B 


C D 


Figure 9 A network equivalent to the one in Figure 8 

How can we find the labels for the arcs in the new, simpler network 
diagram? Here’s how you can find the label for the arc from A to C, for 
example. 

Suppose that 1 litre of water is input at node A in the combined network 


diagram, and there are no other inputs. Then, from the arc labels in the 
top half of the diagram, you know that the amounts of water passing 


through nodes U, V and W are 0.4, 0.2 and 0.4 litres, respectively. So, 
from the arc labels in the bottom half of the diagram, you can work out 
the following. 


e Of the 0.4 litres of water that passes through node U, the amount that 
reaches node C is 0.2 x 0.4 = 0.08 litres. 


e Of the 0.2 litres of water that passes through node V, the amount that 
reaches node C is 0.6 x 0.2 = 0.12 litres. 


e Of the 0.4 litres of water that passes through node W, the amount 
that reaches node C is 0.3 x 0.4 = 0.12 litres. 


To find the total amount of water output at node C, we add the 
contributions from the three different routes, which gives 

0.08 + 0.12 + 0.12 = 0.32 litres. So the proportion of the water input at 
node A that’s output at node C is 0.32, and hence the arc from A to C 
should be labelled with the number 0.32. 


You can find the labels for the remaining arcs in a similar way. However, 
these calculations are quite tedious, and it turns out that they can be 
simplified by using the matrices that represent the two original networks, 
which are 


ie 0.2 0.6 0.3 
0.2 0.1 and 08 04 07)? 
0.4 0.6 i i i 


respectively. These matrices were found in Subsection 2.2. 


Suppose that the input and output vectors (in litres) for the combined 
network in Figure 8 are 


9 e (): 


respectively. Suppose also that the output vector (in litres) of the top part 
of the network, which is also the input vector for the bottom part, is 


u 
U 
w 
Then 
an 0.4 0.3 (:) na eat 0.6 4 p 
2 PONG d)~\o8 04 0.7) |? 


Using the first of these equations to substitute in the second gives 


eG. 0.6 a] < (a) 
d 0.8 0.4 0.7 na He b 


You know from Section 1 that matrix multiplication is associative, so 


eG e 0.6 i Me a Gr 
d 0.8 04 07) \ on og b 
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that is, 


Eri! (2) 


The conclusion of this discussion is that the matrix that represents the 
combined network in Figure 9 is the product of the matrices that represent 
the original networks. So matrix multiplication provides a neat way to 
calculate outputs of the combined network. 


Notice that the matrix representing the top network is the second matrix 
in the product that represents the combined network. 


The labels of the simpler network that is equivalent to the combined 
network can be read off the product matrix. This is easier if we label the 
rows and the columns with the nodes as below: 


A B 
C (0.32 0.3 
D \0.68 0.7) ° 
Notice that each column of this matrix has sum 1, as expected in view of 


the comment after Example 3. 


The original combined network and the equivalent simpler network, with 
the arc labels, are shown in Figure 10. 


0.68 
0.32 0.7 


Figure 10 The two equivalent networks 


Activity 17 Finding the matrix representing a combined network 


The network below is the combination of two smaller networks, one with 
input nodes U and V and output nodes P and Q, and the other with input 
nodes P and Q and output nodes A, B, and C. 


(a) 


Write down the matrix equation that gives the output vector (c ) at 
nodes P and Q when the input vector is (‘) at nodes U and V. 


a 
Write down the matrix equation that gives the output vector | b | at 


nodes A, B and C when the input vector is C ) at nodes P and Q. 


Use your answers to parts (a) and (b) to find the matrix equation for 
the combined network when the input vector is @ at nodes U 


and V. 


Draw a network with input nodes U and V and output nodes A, B 
and C that is equivalent to the given network. 


Suppose that the given network represents a network of water pipes. 
Determine the outputs at nodes A, B and C when 1 litre of water is 
input at U and 2 litres of water is input at V. 


Joining networks is also useful in other contexts. For example, Figure 11 
repeats the network diagram from Figure 5, which represents the flow of 
money from three corporate donors D, E and F to two charities J and K. 
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Figure 11 A network diagram representing the flow of money from three 
corporate donors to two charities 


Suppose that donors D, E and F are the only donors to the two charities, 
and that the two charities spend their money in two different geographical 
regions, R and S, as indicated in Figure 12. So, for example, charity J 
spends 60% of its money in region R and 40% in region S. 


J K 


0.6 0.1 


R S 


Figure 12 A network diagram representing the flow of money from two 
charities into two regions 


In the next activity you’re asked to combine the networks in Figures 11 
and 12 to obtain a network that represents the flow of money from the 
three corporate donors to the two regions. 


Activity 18 Finding the matrix representing another combined 
network 


(a) Write down the matrices that represent the networks in Figures 11 
and 12. 


(b) Hence find the matrix that represents the network obtained by 
combining these two networks. 


(c) Draw a simplified network that represents the combined network 
described in part (b). 


(d) If, in a particular month, donor D donates £1700, donor E donates 
£1400 and donor F donates nothing, how much money is spent in 
region S? 
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In a real-life situation, networks like the one in Activity 18 would have 
many more input and output nodes, and the matrices representing them 
would be manipulated on a computer. 


3 The inverse of a matrix 


In Section 1 you met four matrix operations: addition, multiplication by a 
scalar, subtraction and matrix multiplication. No division was mentioned: 
it is not possible to divide a matrix by another matrix. However there is a 
way to manipulate matrices that in certain situations plays the role of 
division. To understand this method, you need to know about special 
matrices called identity matrices, which are introduced next. 


3.1 Identity matrices 


The number 1 has a special property among real numbers: multiplication 
by 1 leaves any real number unchanged. An identity matrix is a matrix 
that behaves like the number 1, in the sense that if a matrix A is 
multiplied by an identity matrix of an appropriate size then the result is 
again A. You will meet an identity matrix in the next activity. 


Activity 19 Multiplying by an identity matrix 


3 4 2 7 -3 1 0 
Let A = E al B= e 1 >) and I= (o a Evaluate each of 


the following products, where possible: 


AI, IA, BI, IB. 


The matrix I in Activity 19 has the following two properties. 

e If A is any matrix such that the product AI exists, then AI = A. 
e If A is any matrix such that the product IA exists, then IA = A. 
Any matrix I with these two properties is called an identity matrix. 


For a matrix to be an identity matrix, it has to be square. To see this, 
suppose that A is an m x n matrix. Then in order for the product AI to 
be defined, I must have n rows. Moreover, in order for AI to have size 
m x n (which it must if it is to be equal to A), I must have n columns: 


mx{n | times [n]xn gives m xn. 


So I must have size n x n. 


3 The inverse of a matrix 
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It’s straightforward to verify that the following 2 x 2, 3 x 3 and 4 x 4 
matrices are identity matrices: 


1 0 0 0 
1 0 Ae 0 1 0 0 
O17’ ooa) 19 8 1 O 
0 0 0 1 


In fact, for every natural number n, the n x n matrix that has ones down 
the leading diagonal (the diagonal that starts at the top left element and 
ends at the bottom right element), and zeros everywhere else, is an 
identity matrix. 


Furthermore, for every natural number n, this matrix is the only n x n 
identity matrix. To see this, let I be the n x n identity matrix described 
above, and let E be an n x n identity matrix. Then IE = I and IE = E, 
so I= E. 


When we’re working with matrices, we usually reserve the letter I to 
denote the n x n identity matrix. 


Activity 20 Multiplying by an identity matrix 


x 


: : > and B = | y | be general matrices, and let I be the 
z 


3 x 3 identity matrix. Verify that AI = A and IB = B. 


Let A = ( 


The facts that you’ve seen about identity matrices are summarised in the 
following box. 


Identity matrices 

An identity matrix is a square matrix I such that 

e for any matrix A for which the product AI is defined, AI = A 
e for any matrix A for which the product IA is defined, IA = A. 


Each identity matrix has the form 


l O @ sce © 
© i Q ove @ 
© O L soo @ 
© © WO ss 1 


That is, it has ones down the leading diagonal and zeros elsewhere. 


3 The inverse of a matrix 


3.2 Matrix inverses 


As mentioned earlier, it isn’t possible to divide matrices, but it is 
sometimes possible to perform a manipulation with a similar effect. 


Remember that, in the arithmetic of numbers, dividing by a number is the 
same as multiplying by its reciprocal. For example, dividing by 2 is the 
same as multiplying by 5. The reciprocal of a non-zero number a is the 
number b such that ab = 1. 


There is a notion in matrix arithmetic that is analogous to the notion of a 
reciprocal in ordinary arithmetic. Let A be a square matrix. If there is 
another matrix B of the same size with the property that 


AB=I and BA =I, 


where I is an identity matrix, then we say that A is invertible and that B 
is an inverse of A. Multiplying by an inverse of a matrix is useful in many 
situations where you might want to use a matrix operation analogous to 
division. 


Activity 21 Checking matrix inverses t thought we told us 
to practise Luverting 
For each of the following pairs of matrices A and B, check that B is an mattresses! 


inverse of A, by finding the products AB and BA. 


2 3 1 © 1 41 
(b) A={-1 1 0], B=[0 2 -1 
=F 1 0 1 -8 5 


If a matrix A is invertible, then it has exactly one inverse. To see this, 
suppose that the matrices B and C are both inverses of the matrix A. 
Then 


AB=I and BA =I, andalso AC=TI and CA=lI. 

Now consider the matrix BAC. Since matrix multiplication is associative, 
BAC = (BA)C=IC=C, 

and also 
BAC = B(AC) = BI=B. 

Hence B and C are the same matrix. That is, A has only one inverse. 


We usually write the inverse of an invertible matrix A as A~!. So the 
following facts hold. 
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Inverse of a matrix 
If A is an invertible matrix, then 
AA =I and A™HA =I, 


where I is the identity matrix of the same size as A. 


It follows from the definition of the inverse of a matrix that if A is an 
invertible matrix, then not only is A~! the inverse of A, but also A is the 
inverse of A~!. In other words, A and A! are inverses of each other. 


In the rest of this subsection, you'll learn how to establish whether any 
particular 2 x 2 matrix has an inverse, and how to work out the inverse 
when it exists. You’ll also use a computer to calculate inverses of larger 
matrices. 


Inverses of 2 x 2 matrices 


For 2 x 2 matrices, there is a useful formula for finding inverses. 


Inverse of a 2 x 2 matrix 


The inverse of the 2 x 2 matrix ( i) is given by 


ad — bc \—c a 


In other words, to obtain the inverse when ad — bc 4 0, swap the two 
elements on the leading diagonal and multiply the other two elements 
by —1, then multiply the resulting matrix by the scalar 1/(ad — bc). 


: ( 7 z) , provided that ad — bc Æ 0. 


You'll be asked to verify this general formula later in this subsection, but 
for now let’s check that it works for the matrix 


a= (53) 


in Activity 21(a). Here ad — be = 8 x 2 — 5 x 3 = 1 Æ 0, so the scalar 
1/(ad — bc) is 1. You can see that the matrix 


B= (5) 


in Activity 21 (a) is indeed obtained from A by swapping 8 and 2, 
multiplying the other two elements by —1, and multiplying the resulting 
matrix by the scalar 1. 


Notice that the box above includes the condition ad — bc Æ 0. This is 
because the formula involves dividing by ad — bc, and division by 0 is not 
defined. If the elements of a 2 x 2 matrix do not satisfy this condition, 


then the matrix does not have an inverse. You’ll see a proof of this fact in 
the next section. 


In fact the quantity ad — bc that appears in the formula is so important 
that it deserves a name, as follows. 


Determinant of a 2 x 2 matrix 


Let A = ‘ : . Then the number ad — bc is called the 


determinant of A, and written as det A. 
If the determinant of A is not zero, then A is invertible. 


If the determinant of A is zero, then A is not invertible. 


Some texts denote det A by |A|, which looks like the notation for the 

modulus of a real number, but has a different meaning in this context. 

This notation is also used when a matrix is written out in full; for example, 
| 2 1 


3 0 =2 x 0—1 x (-3). 


When seeking the inverse of a 2 x 2 matrix, you should first check that the 
matrix is invertible by calculating the determinant. If the determinant is 
not zero, then you can use the rule for finding the inverse. 


3 The inverse of a matrix 
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You can practise finding the inverses of 2 x 2 matrices in the next activity. 


Activity 22 Finding inverses of 2 x 2 matrices 


Determine which of the following matrices are invertible. For each matrix 
that is invertible, write down its inverse. 


(a) A= E >) Op e >) 


-3 į 1.5 25 
(©) c= ( 5 >) go E is) 


In the next activity you’re asked to verify that the formula for the inverse 
of a 2 x 2 matrix works in general. 


4 Simultaneous linear equations and matrices 


Activity 23 Verifying the formula for the inverse of a 2 x 2 matrix 
a b ; 1 d —b 
Let A= (" I) with ad~be# 0, and B=- ( ’). 
Work out the products AB and BA, and deduce that B = A7!. 


Hint: use the fact that P(kQ) = k(PQ) for any matrices P and Q for 
which the product PQ is defined and for any scalar k. 


A matrix that does not have an inverse is called a non-invertible matrix. 
In some texts, non-invertible matrices are called singular matrices, and 
invertible matrices are called non-singular matrices. 


Determinants and inverses of larger matrices 


You have seen that a 2 x 2 matrix has an inverse only if its determinant is 
not zero. 


The determinant is defined for square matrices of any size, and it plays the 
same role as in the 2 x 2 case: any square matrix A such that det A 4 0 
has an inverse A~!. If det A = 0, then no inverse can be found. 


It is possible to use hand calculations to work out the determinant and the 
inverse (when it exists) of a square matrix whose size is larger than 2 x 2, 
but the process is rather complicated, and beyond the scope of this 
module. However, you can use the computer algebra system to calculate 
determinants and inverses of larger matrices. 


Activity 24 Working with matrix inverses on a computer 


Work through Subsection 10.2 of the Computer algebra guide. 


4 Simultaneous linear equations and 
matrices 


In Section 3 of Unit 2, you met simultaneous linear equations and you saw 
two methods (substitution and elimination) for solving them. Here you'll 
meet a third method, which uses matrices. 
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4.1 Solving simultaneous linear equations in two 
unknowns 


Consider the following simultaneous linear equations in the two 
unknowns x and y: 


0.4% + 0.2y = 4 


3 
0.6 + 0.8y = 11. (3) 


There’s a useful way to write equations (3) as a single equation involving 
two-dimensional vectors (which are 2 x 1 matrices). To do this, you write 
the sides of each equation as the components of a vector, as follows: 


0.44+0.2y\ (4 
0.67 +0.8y)/  \11/)° 


You can now write the left-hand side of this equation as the product of a 
2 x 2 matrix and a two-dimensional vector: 


(08 oa) G) = (ih). 


The 2 x 2 matrix that appears in this equation is called the coefficient 
matriz of the simultaneous linear equations (3). Its elements are the 
numbers that appear on the left-hand sides of equations (3). Any pair of 
simultaneous linear equations in two unknowns can be written in this 
form. 


Matrix form of simultaneous linear equations 
The simultaneous linear equations 

ax + by =e 

cx + dy = f 


can be written as the matrix equation 


eaa G)=@)- 


The matrix (: J is called the coefficient matrix. 


d 


Note that the coefficient matrix of simultaneous linear equations depends 
on how the equations are written. For example, if you write equations (3) 
as 


0.6x + 0.8y = 11 
0.42 + 0.2y = 4, 


then you obtain the coefficient matrix 

0.6 0.8 

0.4 0.2)’ 
which is different from the coefficient matrix obtained above for the same 
equations. 


4 Simultaneous linear equations and matrices 


Example 5 Writing simultaneous linear equations in matrix form CS) 
Write the following pair of equations in matrix form. — 
y- al 
W=? 
Solution 


@. Rearrange each equation so that the unknowns are on the 
left-hand side and in the same order in each equation, and the 
constant terms are on the right-hand sides. © 


The equations can be rewritten as follows: 
2z + (-1)y=1 
(Op == Sy) = 2. 
@. Now read off the matrix form from the equations. ©& 


Therefore the equations can be written in matrix form as 


(0-0) 


There are other possible solutions to Example 5. For example, the 
equations can be rewritten in the form 


—2¢+ y=-1 
Ox + 3y = 2, 


which gives the matrix form 
-2 1\ (2) fai 
0 3) \yy \ 2° 
In the next activity you can practise finding matrix forms of pairs of 
simultaneous linear equations. 
Activity 25 Writing simultaneous linear equations in matrix form 


Write each of the following pairs of equations in matrix form. 


(a) 22+ 3y =3 (b) 22-6 =0 (c) 2x — $y = 18 
zx + 4y = —1 3x + 6y = 15 tr + y= -1 
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The matrix form of a pair of simultaneous linear equations is not just 
convenient shorthand: it is the initial step in a method for solving the 
equations. Let’s see how this method works for equations (3) from the 
beginning of this subsection, which have the matrix form 


(05 oa) G) = (a): 


Let A be the coefficient matrix; that is, 
0.4 0.2 
A= o i : 


Then the matrix form of the equations can be written as 


D (3) 7 a) 


Since det A = 0.4 x 0.8 — 0.2 x 0.6 = 0.2, the matrix A has an inverse AW}. 
If we multiply both sides of the equation above by A~!, then the result is 


a =a (a) 


By the associativity of matrix multiplication, this equation can be 
rearranged as: 


win (a8) 


Since A~'A =I, where I is the 2 x 2 identity matrix, the equation above 
can be rewritten as 


G)-* (8) 


Since I is an identity matrix, our equation becomes 


m ° 


Therefore, if we work out A~! and calculate the product on the right-hand 
side of this matrix equation, then we’ll find a solution to the original 
equations. Now 


Ai if 08 —0.2\_/ 4 -1 
0.2 \—0.6 04 =f 2J- 
So substituting for A~! in equation (4) gives 
c\ | 4 -1 4\ /5 
y) \-3 2/\11/  \l0/° 


In other words, 


x=5 and y=10. 


4 Simultaneous linear equations and matrices 


The example above illustrates a general method for solving simultaneous 
linear equations, which can be summarised as follows. 


Strategy: 
To solve a pair of simultaneous linear equations in two 
unknowns using matrices 


Write the simultaneous linear equations in matrix form Ax = b, 
where A is the coefficient matrix, x is the corresponding vector of 
unknowns, and b is the vector whose components are the 
corresponding right-hand sides of the equations. 


If the matrix A is invertible, then the solution is given by 


x= AİD. 
Example 6 Using matrices to solve simultaneous linear equations A 
Use matrices to solve the following pair of simultaneous linear = 
equations. 

zx — 5y = —3 

z+ 3y = 13 
Solution 


@. Find the matrix form of the equations. © 


The matrix form of the equations is 


G3) G) = Gs) 
1 BH AW lgj 
@. Find the inverse of the coefficient matrix. © 
The determinant of the coefficient matrix is 
ix 3 (45) x18, 


so the inverse of the coefficient matrix is 


eee 


Cole olw 
QolF Color 
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As usual, you can check the answer to Example 6 by substituting back into 
the original equations. 


You can practise the method in Example 6 in the next activity. 


Activity 26 Using matrices to solve simultaneous linear equations 


Use matrices to solve the following pairs of simultaneous linear equations. 


(You were asked to write these pairs of equations in matrix form in 
Activity 25.) 


a) 22+ 3y=3 b) 2a-—6 =0 c) 24—4y = 18 
5 5 
xz + 4y = -1 3x + 6y = 15 tn +2y=~1 


4.2 Non-invertible coefficient matrices 


You saw in Unit 2 that a pair of simultaneous linear equations in two 
unknowns can have no solutions, exactly one solution or infinitely many 
solutions. If it has exactly one solution, then we say that it has a unique 
solution. 


In this subsection you'll see that a simple way to tell whether a pair of 
simultaneous linear equations in two unknowns has a unique solution is to 
work out the determinant of its coefficient matrix. 


If the determinant is non-zero, then the coefficient matrix has an inverse 
given by the formula that you saw in Subsection 3.2, and hence the 
equations have a unique solution, by what you saw in Subsection 4.1. 


Let’s now look at what happens when the determinant is zero. Let’s start 
by looking at two examples. 


4 Simultaneous linear equations and matrices 


First, consider the following pair of simultaneous equations: 
2x — 3y = 5 
—4z + 6y = 7. 


The coefficient matrix is 


(a) 


which has determinant 2 x 6 — (—3) x (—4) = 0. Let’s look for a solution 
to the equations using the elimination method. If we multiply both sides of 
the first one of equations (5) by —2, then we get 

—Azx + 6y = —10 

—4¢+6y= 7. 
These equations have identical left-hand sides, but different right-hand 


sides. So they have no solution: no values of x and y can make —4g + 6y 
equal to both —10 and 7. 


Now consider the following pair of simultaneous equations: 
x+2y = —6 
—3x — by = 18. 


The coefficient matrix is 


(3-4); 


which has determinant 1 x (—6) — 2 x (—3) = 0. Again, let’s look for a 
solution to the equations using the elimination method. If we multiply 
both sides of the first of equations (6) by —3, then we get 


—3x = 6y = 18 
—3x — by = 18. 


These equations are identical. So they have infinitely many solutions: all 
values of x and y that satisfy the first equation also satisfy the second 
equation. 


You’ve now seen two cases where the coefficient matrix has determinant 
zero. In the first case, the simultaneous equations had no solution. In the 
second case, they had infinitely many solutions. In fact, whenever the 
coefficient matrix of a pair of simultaneous linear equations has 
determinant zero, one of these two possibilities must occur. 


To see why, consider any two simultaneous linear equations 
ax + by =e 
ca + dy = f, 


and suppose that the determinant of the coefficient matrix is zero; that is, 
ad — bc = 0. We can assume that in each equation at least one of the 
coefficients is non-zero; otherwise, the equation is of the form Ox + 0y = 
constant, so the pair of equations has either no solutions or infinitely many 
solutions. 
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Suppose that in the first equation a Æ 0; a similar argument holds if b Æ 0. 
Since ad — bc = 0, we have d = bc/a. Hence c +Æ 0, since if c = 0 then d= 0 
also. Multiplying the first equation by c/a gives 


cx + (be/a)y = ce/a 
cz + dy = f. 
Since d = bc/a, these two equations can be rewritten as 


cx + dy = ce/a 


cz + dy = fÍ. 


These equations have identical left-hand sides. So they (and hence the 
original equations) have either no solution (if ce/a 4 f) or infinitely many 
solutions (if ce/a = f). 


So, in conclusion, if the coefficient matrix has determinant zero, then the 
equations have either no solution or infinitely many solutions. 


We can deduce from this fact that if a 2 x 2 matrix has determinant zero, 
then it is not invertible. This is a fact that was mentioned in 

Subsection 3.2. To see how, consider any 2 x 2 matrix with determinant 
zero. If the matrix were invertible, then any pair of simultaneous equations 
with this matrix as its coefficient matrix would have a unique solution, 
which cannot be the case, since you saw above that if the determinant of 
the coefficient matrix is zero, then the equations have no solution or 
infinitely many solutions. 


So you have learned the following important payoffs from the matrix 
method for simultaneous linear equations. 


For a pair of simultaneous linear equations in two unknowns: 


e if the determinant of the coefficient matrix is non-zero, then this 
matrix is invertible and the equations have a unique solution 

e if the determinant of the coefficient matrix is zero, then this 
matrix is not invertible and the equations have no solution or 
infinitely many solutions. 


Thus, calculating the determinant of the coefficient matrix gives a quick 
test for the existence of a unique solution. 


Activity 27 Determining whether simultaneous equations have 
unique solutions 


Which of the following pairs of simultaneous linear equations has a unique 
solution? (You are not asked to find the solution.) 
(a) 5¢—3y=5 (b) 32- y= -6 

-r + 4y =7 —9x + 3y = 18 


4 Simultaneous linear equations and matrices 


4.3 Simultaneous linear equations in more than two 
unknowns 
The matrix method that you met in Subsection 4.2 can be used to find a 


solution for more than two simultaneous equations in more than two 
unknowns. 


In Subsection 5.3 of Unit 1, you learned that a linear equation is one in 
which, after you’ve expanded any brackets and cleared any fractions, each 
term is either a constant term or a number times a variable. For example, 
the equation 

e+2y-z=3 


is linear, but none of the equations 


at+y?=2, r+yz=1 and r+% =1 


are linear. In this subsection we’ll usually write a linear equation in a form 
similar to the linear equation above, that is, with its left-hand side as a 
sum of terms, each of which is a number times a variable (and with each 
variable appearing just once), and with the right-hand side as a number. 


We’ll refer to a collection of linear equations in a given set of unknowns as 
a system of linear equations. For example, here’s a system of three 
linear equations in three unknowns, x, y and z: 


2x + 2y +32 =2 
2z + 3y + 4z = —1 
4z + Ty + 10z = 3. 


A solution to a system of linear equations is an assignment of values to 
the unknowns that makes all the equations hold simultaneously. As in 
Subsection 3.1 of Unit 2, the process of finding these values is known as 
solving the equations simultaneously. We’ll look only at systems where the 
number of unknowns is equal to the number of equations. 


Numerical problems that can be modelled by systems of linear equations 
often arise in applications. Consider, for example, a bank that offers three 
types of investment to its customers: a financial product A that pays 
guaranteed interest of 2.5% over two years and is quite safe, and two 
financial products B and C that in the same time period are likely to yield 
interests of 5% and 6% respectively, but are more risky. 


Suppose that a customer is seeking to invest £9000 over two years, and is 
advised to spread his investment across the three types of financial 
products. He is risk-averse, so he wants to invest a total of only £4000 
across the two riskier products B and C. Of this £4000, he wants to invest 
just enough in the higher-yielding product C to ensure that his total 
£9000 investment is likely to yield an overall return of 4%, with the 
remainder of the £4000 invested in the lower-yielding product B, to spread 
his risks. How should he spread his investment across the three products in 
order to achieve his desired return? 
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First of all, the amounts invested in each product must add up to £9000. 
So, if x, y and z represent the amounts invested (in £) in products A, B 
and C, respectively, then 


£ +y + z = 9000. 


Moreover, the investor wants to invest a total of £4000 in products B 
and C, so 


y + z = 4000. 


Since the interest on his investment in product A is 2.5%, the investor’s 
expected interest payment from his investment in product A is 0.0252. 
Likewise, his expected interest payments from his investments in products 
B and C are 0.05y and 0.06z, respectively. The investor is seeking an 
overall return of 4%, that is, an overall interest payment of 

0.04 x £9000 = £360. Thus the amounts invested in the three products 
must also satisfy the equation 


0.0252 + 0.05y + 0.062 = 360. 


Therefore, the amounts x, y and z must simultaneously satisfy the three 
linear equations 


z+ yt  2=9000 
y+ z= 4000 (7) 
0.0252 + 0.05y + 0.062 = 360. 


Such systems of linear equations can be solved efficiently by methods 
similar to elimination and substitution — you can probably see that 
equations (7) can be solved fairly easily for x, y and z by 


e first subtracting the second equation from the first to give x = 5000 


e then substituting this value for x into the third equation and solving 
the resulting two simultaneous equations for y and z. 


Methods of this sort are covered in detail in other modules. Systems of 
linear equations like those above can also be solved by using matrices, and 
the rest of this subsection covers this approach. 


Any system of n linear equations in n unknowns can be written as a single 
matrix equation 


Ax = b, 


where A is an n x n coefficient matrix, x is an n-dimensional vector whose 
components are the unknowns, and b is an n-dimensional vector, as you 
saw for the case n = 2 in Subsection 4.2. If the coefficient matrix A is 
invertible, then the matrix equation can be solved by multiplying both 
sides by the inverse of A, giving x = ATtb, just as in the 2 x 2 case. 


In Section 3 you learned how to use a computer to find the inverses of 
matrices of size 3 x 3 and larger. This enables you to use the matrix 
method to solve systems of three or more linear equations. In the next 
example, this method is used to solve equations (7). 


4 Simultaneous linear equations and matrices 


Example 7 Solving a system of three linear equations 


Use the matrix method to solve the following system of three linear 
equations. 


ce yt 2=9000 
y+ z= 4000 
0.0252 + 0.05y + 0.06z = 360 
Solution 


The matrix form of the system of three equations is 


1 i i x 9000 
0 1 1 y | = | 4000 
0.025 0.05 0.06/ \z 360 


@. Use the computer algebra system to find the inverse of the 
coefficient matrix. ® 
The inverse of the coefficient matrix is 
1 —1 0 
2.5 3.5 —100 
—2.5 —2.5 100 


@. Multiply each side of the matrix equation by the inverse of the 
coefficient matrix. ® 


Therefore the solution is given by 


ae 1 = 0 9000 5000 
a || = AD Bs) IOC 4000 | = | 500 
z <2 =25 OO 360 3900 


Note that you could use the computer algebra system to solve the 
simultaneous equations (7) directly, that is, without first finding the 
inverse of the coefficient matrix, and so check the answer to Example 7. 


The solution to equations (7) found in this example tells our investor that 
the best way to spread his investment in order to achieve a 4% return is as 
follows: 


£5000 on investment A 
£500 on investment B 
£3500 on investment C. 


You can practise finding inverses of matrices and using them to solve 
systems of more than two linear equations in the next activity. 
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Solve each of the following systems of simultaneous linear equations by 
using the computer algebra system to find the inverse of the coefficient 


matrix. 

(a) 2z + 2y+3z=2 (b) 3r +2y+z=1 
2x + 3y + 4z = —1 4r +3y+z=2 
4z + Ty +10z=3 Tz +5y+z=1 


In fact using matrix inverses is not usually the most efficient way to solve 
systems of linear equations, but the method has theoretical importance. 


For example, it can be shown that, for all n > 2, 

e ifthe determinant of the coefficient matrix of a system of n equations 
is non-zero, then this matrix is invertible and the equations have a 
unique solution 

e ifthe determinant of the coefficient matrix of a system of n equations 


is zero, then this matrix is not invertible and the equations have no 
solution or infinitely many solutions. 


You saw the case n = 2 of this result in Subsection 4.2. 
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The following activity provides further practice on the significance of the 
determinant of the coefficient matrix for systems of three or more linear 
equations. 


Activity 29 Determining whether systems of linear equations have 
unique solutions 


Which of the following systems of linear equations have a unique solution? 


(a) c+2y+3z=5 (b) 2x + 2y+ 32 =5 
2x+3y+ z=6 2x + 3y z=6 
4¢+7y+7z=1 4x + Ty+7z=1 


(c) e+ y- w+z=0 
t= ytwt+2z=2 
22 +3w-—z=1 
6x + 2y + 4w =0 


Learning outcomes 


After studying this unit, you should be able to: 

e use matrix notation 

e evaluate sums, differences and scalar multiples of matrices 
e establish whether two matrices can be multiplied together 


e evaluate a matrix product both by hand and by using the computer 
algebra system 


e determine whether a given n x n matrix has an inverse 


e work out the determinant and inverse (if it exists) of a 2 x 2 matrix by 
hand 


e use the computer algebra system to work out the determinant and 
inverse (if it exists) of a square matrix 


e write a system of linear equations in matrix form and solve it using 
matrix methods where possible. 


Learning outcomes 


269 


Unit 9 Matrices 


Solutions to activities 


Solution to Activity 1 Solution to Activity 4 
We have a33 = 3, a34 = —4, a43 = —7, . 4 3\ /12 9 
ay4 = —1, a44 = —9, a42 = —5. (a) @) 3 2 1} \6 3 
The matrix A has four rows and four columns, so 


its size is 4 x 4. (ii) 


Solution to Activity 2 


@ A+B=(5 De ae 2 i 
1 


In each of the answers in this part, there is 
= ( Po = >) more than one way to choose the scalar 

multiple. For example, another correct 
solution to part (i) is 


Solution to Activity 3 i 5 3 ) a a 
f = 0.25 ; 


= t 4 3 -6 2 ALS 8 6 
Ga- e e. 
1 0 -1 2 2 2 : = 
Solution to Activity 5 


3 
_ (2 -2 _{ 2 6 We have 
1 0 —2 2 i i 2 —1 
+ 
2 


(b) B- A = 


T 

" 

vs 

l 

Q 

II 

Oe a E a 

l 
= nie e nie 
N A 
NES 

l 


The matrix 2A has size 2 x 3 and B has size 2 x 2; 
hence the sum B + 2A is not defined. 


Finally, 


ETT oe TE 
=- (1 3 5 8 —4 0 
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Solution to Activity 6 


For each pair of matrices in this question, the sizes 


of the matrices are written underneath the matrices, 


and the middle two numbers are boxed in order to 
use the method described in the text. 


2 9 


3x|2] [2|x3 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


product is 
2 9 sgi 32 6 47 
7 1 ( 40 s) = | -10 2h 12 
—6 8 44 —18 34 


9 
(b) (1 2 3)|8 


1x[3 | [3 ]x1 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


product is 
9 
(1 2 3){8]} = (46) 
7 
2 9 0 10 
(c) Tal 4 1 
—6 8 —2 7 


s] Ehe 


The middle numbers are different, so these 
matrices cannot be multiplied together. 


9 
(a) [8 


3x[1| [a ]xa 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


(1 2 3) 


product is 
9 9 18 27 
8| (1 2 3)=|8 16 24 
í 7 14 21 


Solutions to activities 


; gay fo l 
(e) zo ailll oai 
21 3 


axla] [3]x3 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 
product is 


Teel 
TEI a 15 1 4 


Solution to Activity 7 


(a) (i) Here A has size 2 x 2 and B has size 2 x 1. 
For AB, the size check is 


2x2] [2 ]x1. 


The middle numbers are the same, so the 
product AB is defined. 


For BA, the size check is 


ax{ 1] [2 [2 


The middle numbers are different, so the 
product BA is undefined. 


(ii) Here both A and B have size 2 x 2, so the 
products AB and BA are both defined. 


(b) For the matrices in part (a)(i), the product BA 
is not defined. 


G 5 


For the matrices in part (a)(i 


(EDC 
i 


m- ( NC 


Therefore AB 4 BA. 


Solution to Activity 8 


(a) The size of A is 1 x 2 and the size of B is 2 x 2. 
The size check for the product AB is 


1 x2] [2 |x 2: 
Thus AB has size 1 x 2. 


The size of C is 2 x 3. The size check for the 
product BC is 


ax|2| [2 |x 3. 


Thus BC has size 2 x 3. 
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We have 
AB =(-1 0) e ne 1), and 
1 -1 1 3 
BC = (3 Ei E 4 
f4 
“AS 2 


(b) From part (a), the size of AB is 1 x 2. Since C 
has size 2 x 3, the size check for (AB)C is 


1x <3. 
So the product (AB)C exists and has size 1 x 3. 


From part (a), the size of BC is 2 x 3. Since A 
has size 1 x 2, the size check for A(BC) is 


1x x 3. 
So the product A(BC) exists and has size 1 x 3. 


Using the answers for the matrices AB and BC 
found in part (a) we obtain 


(AB)C = (-1 1) (3 i D 


=(-4 1 1) 
and 
A(BC) = {i 0) € E E 
=(-4 1 1). 


We can see that (AB)C = A(BC). 


Solution to Activity 9 


We have 
1 0 2 0 
m=2(> 1)=( 2): 
so 
1 2 2 0 10 4 
agi 4)(G 9)=( a) 
Also, 
1 2 1 0 5 2 
aB= (5 AG E 
so 
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Solution to Activity 10 


We have 
ere ()+(1)-Q) 
woh DO- 
Also, 


1 -l 1 =] 
an= (s a (2)=(u), 
1 -l =] —2 
acm (9 a) (a) (4), 
so 
= —2 —3 
anac = (11) +CD = a) 
Thus A(B + C) = AB + AC, as required. 


Solution to Activity 11 


(a) The product of two matrices A and B can be 
formed only if the number of columns of A is 
equal to the number of rows of B. Since M has 
two rows and three columns, the number of its 
rows is not equal to the number of its columns. 
Thus the product MM cannot be formed. 


The number of rows of N is equal to the 
number of columns of N, so the product NN is 
defined. We have 


s Gee a) 


(c) A matrix A must have the same number of 
rows as columns in order for the product AA to 
be defined. So only matrices that have the same 
number of rows as columns can be squared. 
(You might remember that such matrices are 
called square matrices.) 


Solution to Activity 12 


o G)-GIGA-G ¥ 


2 


(b) 


From the text preceding this activity, we know 


that A’ = G J Therefore 


A® = AA? = (ae? 
_ (20 7\*_ (20 7\ (20 7 
~\o1 6) Zla 6) la 6 


_ (547 182 
= (546 183)" 


Solution to Activity 14 


(a) 


Inputting 1 litre of water at node U gives 
0.2 litres of water output at node C and 
0.8 litres of water output at node D. 


Inputting 2 litres of water at node V gives 
2 x 0.6 litres of water output at node C and 
2 x 0.4 litres of water output at node D. 


So, in total, we have 0.2 + 2 x 0.6 = 1.4 litres of 
water output at node C, and 0.8 + 2 x 0.4 
= 1.6 litres of water output at node D. 


Using a method similar to that of part (a) gives 
0.2x + 0.6y + 0.32 litres of water output at 
node C and 0.8x + 0.4y + 0.7z litres of water 
output at node D. 


Solution to Activity 15 


(a) 


If £2000 is input at node D, then the output at 
node J is 0.2 x £2000 = £400. 


Similarly, the output at node K is 
0.8 x £2000 = £1600. 


If the input at node D is £2400, at node FE is 
£1700, and at node F is £1100, then the 
output (in £) at node J is 

0.2 x 2400 + 0.3 x 1700+ 0.5 x 1100 = 1540, 
and at node K is 

0.8 x 2400 + 0.7 x 1700 + 0.5 x 1100 = 3660. 
If the input (in £) at node D is x, at node E 


is y, and at node F is z, then the output (in £) 
at node J is 


0.2% + 0.3y + 0.52, 
and at node K is 
0.82 + 0.7y + 0.52. 


Solutions to activities 


Solution to Activity 16 


(a) 


(b) 


Since there are three input nodes and two 
output nodes, the network can be represented 
by a 2 x 3 matrix. 


The number that labels the arc from the first 
input node (U) to the first output node (C) 
is 0.2, and this goes in the first row and first 
column. The number that labels the arc from 
the second input node (V) to the first output 
node (C) is 0.6, and this goes in the first row 
and second column. 


Proceeding similarly for all the inputs and 
outputs of the network gives the following 
matrix: 


U V W 
C /0.2 0.6 0.3 
D\0.8 0.4 0.7) 
The vector that represents an input of 1 litre of 
water at nodes U, V and W is 


1 
1 
1 


The output vector for the input vector found in 
part (b) is 


0.2 0.6 0.3 ; 
0.8 0.4 0.7 1 


— (023x1+06x1+03x1I\ _/11 
~\08x14+04x14+07x1)~ \19)° 


This vector represents an output of 1.1 litres of 
water at node C and 1.9 litres of water at 
node D. 
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Solution to Activity 17 (d) Labelling the matrix that represents the 
The matrix that represents the network with combined network gives 
nodes U, V, P and Q is U: yV 
o y A (03 03 
P (05 02 B {0.2 0.14 
Q & o . C \0.5 0.56 
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A simplified network equivalent to the original one 
The matrix equation giving the output : for is given below. 


: u\. 
the input @ is 

p\ (0.5 0.2) fu 

q} \0.5 0.8) Ww)’ 
The matrix that represents the network with 
nodes P, Q, A, B and C is 


P Q 
A /0.3 0.3 
B 10.3 0.1 
C \0.4 0.6 
a 
The matrix equation giving the output | b | for 
c (e) An input of 1 litre at node U and 2 litres at 
the input p ) is node V is represented by the input vector (3). 
Using the equation for the combined network 
a 0.3 0.3 found i h di . 
i= los 04 a l oun = part (c), the corresponding output is 
c 0.4 06) ^1 Ai 
a 0.3 0.3 1 0.9 
a b| = {0.2 0.14 (3) = | 0.48 
The matrix equation giving the output | b | for c 0.5 0.56 1.62 
% Therefore the output is 0.9 litres at node A, 
an input i ii thecombined network is 0.48 litres at node B and 1.62 litres at node C. 
obtained by using parts (a) and (b) as follows. Solution to Activity 18 
a 0.3 0.3 p (a) The matrices (with row and column headings) 
c 0.4 0.6) `I D E F JI K 
0.3 0.3 J {0.2 0.3 0.5 R/0.6 0.9 
: 2 
AOS GIL e on ee 0.7 is) om G ‘eo 
0.5 0.8 v 
0.4 0.6 respectively. 
0.3 0.3 0.5 0.2 i (b) The matrix that represents the combined 
= 0.3 0.1 0.5 0 v network is 
0.4 0.6 0.6 0.9\ /0.2 0.3 0.5 
0.3 0.3 0.4 0.1 0.8 0.7 0.5 
u 
= [0.2 0.14 @ = (0.84 0.81 0.75 
0.5 0.56 ~ (0.16 0.19 0.25)" 


(c) The simplified network is shown below. 


(d) If the input vector (in £) to the combined 
network is 
1700 
1400 | , 
0 


then the output vector (in £) is 
0.84 0.81 0.75 a _ (2562 
0.16 0.19 0.25 0 ~ \ 538 ]° 
So £538 is spent in region S. 


Solution to Activity 19 


The products AI, IA and IB are defined, and they 
give the following results: 


(JG J-C Des 
aG G J J 
m-( YEI -GI 9-8 


The product BI is not defined i B has 
3 columns and I has 2 rows. 


Solution to Activity 20 
We have 


Solutions to activities 


Also, 
1 0 0 x x 
IB=/0 1 0 yJ=ly 
0 0 1 z zZ 


Therefore AI = A and IB = I, as required. 


Solution to Activity 21 


w-()(3 DGN 


2 3 1\ /0 1 ~ 
b) CD=[-1 1 0] ]0 2 -1 
—2 1°0/ U -8 5 
100 
={0 1 0], 
001 
0 1 -~l 2 3 1 
DC={0 2 -1]{-1 1 0 
1 -8 5/ \-2 1 0 
100 
={0 10 
001 


Solution to Activity 22 


(a) We have det A=13x2-—5x5=10. 
Thus A is invertible and 


-1_{ 2-5 
i a = 


(b) We have det B = 3 x 2 — 0 x (— 
Thus B is invertible and 


SONP 
ëls 3/7 3 


(Either of these two forms of the inverse is 
acceptable as your final answer.) 

(c) We have det C = —3 x (—4)- 4 x5 = 0. 
Thus C is not invertible. 


(d) We have det D = 1.5 x 1.5 — 2.5 x 0.5 = 1 #0. 
Thus D is invertible and 


4. fib -25 
a n a 


5)=6 #0. 
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Unit 9 Matrices 


Solution to Activity 23 


Using the property of matrix products given in the 
hint, we have 


AB (4 b 1 d —b 
~ \e dj) \ad—bc \-c a 
ae. a b d —b 
~ ad—be\c d) \-c a 
E 1 ad — bc 0 
~ ad—be 0 ad — bc 


a 1 0 
~ ad — be mg be) € t) 


= {) =I 


Moreover, 


sa- (zhi DE 


oil ad — be 0 -I 
~ ad — be 0 ad—be} ~~ 


Hence AB = BA =I, and it follows that B = A7!. 


Solution to Activity 25 
(a) The coefficient matrix of the equations 
2x+3y=3 
zx + 4y = -1 


is (; A! Thus, the matrix form of the 


equations is 


TORS] 


(b) First rearrange the first equation as 


2x = 6. 

The matrix form of the equations 
2x =6 
3x + 6y = 15 


is 


2 0\/zr\ (6 
3 6) \y)~ \15)° 
(c) The matrix form of the equations 
3x = $y =18 


se + sy =—l 
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GN 
ou ooo 


A G) 7 (5) 


Solution to Activity 26 


(a) 


Activity 25(a) gives the matrix form of the 
equations as 


2 3\/z\_/ù/ 3 
1 4) \y] \-1/)° 
The determinant of the coefficient matrix is 


2x4—-3x1=5, 


so the inverse of the coefficient matrix is 


1/4 -3\ _ 
5 \-1 2) 


Hence 


The solution is x = 3, y = —1. 


Activity 25(b) gives the matrix form of the 
equations as 


2 O\ /x\ [6 
3 6) \y/ M15)’ 
The determinant of the coefficient matrix is 


2x6—-0x3=12, 


so the inverse of the coefficient matrix is 


CPU 9) 


Hence 


(7) =al 2) Gs) =() 


The solution is z = 3, y = 1. 


ele Nle 


Activity 25(c) gives the matrix form of the 
equations as 


[ a (5) E ( D 

4 3 T ‘ 

5 5/ W 1 

The determinant of the coefficient matrix is 


px- Axt 


so the inverse of the coefficient matrix is 


ow oe 


le 
(3-9) 


The solution is x = 10, y = —15. 


Solution to Activity 27 
(a) The matrix form of the equations is 
G 2)G)-G) 
—1 4 y 7). 
The determinant of the coefficient matrix is 
5 x 4— (-3) x (-1) = 17. 
This is non-zero, so the coefficient matrix is 


invertible and the equations have a unique 
solution. 


(b) The matrix form of the equations is 
(253) G) = Gs) 
—9 3 y 18 }° 
The determinant of the coefficient matrix is 
3 x 3—(-1) x (-9) =0. 


This is zero, so the coefficient matrix is 


non-invertible and the equations do not have a 


unique solution. 


Solution to Activity 28 


(a) The matrix form of the equations is 


2 2 3 x 2 
23 4|[y]=[-1 
4 7 10 Z 3 
We have 
sa gy + 2a 
23 4 = | -2 4 -l 
4 7 10 1 -8 1 
Therefore 
z 1 4 -4 2 0 
yļ=ļ|-2 4 -1 -1 | = | —11 
z 1 -3 1 3 8 


The solution is xz = 0, y = —11, z = 8. 


(b) The matrix form of the equations is 


3 2 1\ /2 1 
43 1ıļlf{yļ=12 
7 5 IJ œ 1 


Solutions to activities 


We have 
3 2 1\ = 2 l 
43 1 =|-3 4 —1 
7 5 1 1 1 -l 

Therefore 
£ 2 -—3 1 1 
yļl=ļ-3 4 Sil jele| 4 
Z 1 1 —1 1 

The solution is z = —3, y= 4, z = 2. 


(Details of how to use the CAS to find the 
inverses of the matrices are given in the 
Computer algebra guide, in the section 


‘Computer methods for CAS activities in 
Books A—D’.) 


N 


Solution to Activity 29 


(a) 


(b) 


The matrix form of the equations is 


1 2 3\ /zx 5 
23 1) [y)=(6 
4.7 TJ \z 1 


The determinant of the coefficient matrix is 
zero, so the coefficient matrix is non-invertible 
and the equations do not have a unique 
solution. 


The matrix form of the equations is 


2 2 3\ (£2 5 
23 1) ly)= [6 
4 7 T) \z 1 


The determinant of the coefficient matrix is 14. 
This is non-zero, so the coefficient matrix is 
invertible and the equations have a unique 
solution. 


The matrix form of the equations is 


1 1 —1 1 T 0 
1-1 1 iļfy| l2 
2 0 3 -1 wi] |1 
6 2 4 0 z 0 


The determinant of the coefficient matrix is 
zero, so the coefficient matrix is non-invertible 
and the equations do not have a unique 
solution. 


(Details of how to use the CAS to find the 
determinants are given in the Computer algebra 
guide, in the section ‘Computer methods for 
CAS activities in Books A—D’.) 
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